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i\bs trac t 


The SARl-E program is a prototype of a program which is intended to be 
used as an adjunct to regular classroom work in freshman calculus* Using 
SARGE, Students odd type their step-by-step solution to an indefinite 
integration problem, and can have the correctness of their solutions deter¬ 
mined by thd system* The syntax for these Steps comes quite close to normal 
mathematical notation, given the limitations of typewriter input. The 
method of solution is pretty much unrestrioted as long is no mistakes are 
made along the way. If a mistake is made, SARGE will catch it and yield an 
error message. The student may nodffy the incorrect step, Or he may ask 
the program for advice on how tile mistake arose by typing ’lie Ip”* At present 
the program is weak in generating explanations for mistakes* Sometimes the 
"help 11 mechanise will Just yield a response which will indicate the way in 
Which the erroneous step can he corrected► In order to improve the explana¬ 
tion mechanism one would need a sophisticated analysis of Students' solutions 
to homework or quiz problems. Experience with the behavior of students with. 
SARGE, which is nil at present, should also help in accomplishing this goal* 
SARGE is available as SARGE SAV^D in T30C 251?* 



Introduction 


Our ain in this project usa to use the programs available at jtf-J.T. 
in the area Of algebraic manipulation in order to produce a sophisticated 
teaching pro gran for freshcsn calculus students ^ As a matter of fact, ue 
'-etc not interested in teaching symbolic or indefinite integration as such, 
since the ncncal courses performed this function quite well, We were 
interested in a program which could he used to supplement regular instruc¬ 
tion with supervised practice sessions* Normally, human teaching assistants 
would be used for this purpose. Thus our goal can be said to he a simulation 
of the interaction of a student And a teaching Assistant. In order to 
capture certain aspectA of such An intoruction two major design eon*tderations 
were imposed- 

1) Freshman calculus Students are not computer programmers. Therefore 
the communication with the teaching system should he as natural as possible* 

2) There should h« no impediment to unusual solutions of a problem. 

Such unusual solutions include ingenious solutions as well as solutions 
with superfluous or erroneous steps. A student must be capable of giving 
the complete answer At Any step* This capability will guarantee that gifted 
student* who can solve the problem in their heads will not he forced to give 
a full atep-by-sfcep solution of a problem* Likewise a student who makes a 

afc* 

false attempt at solving a problem must be allowed to backtrack and attempt: 
another path to a solution. 

We wish to distinguish between false attempts and incorrect attempts* In 
^ x^ dx, 

the subset Cut ton y = x is bad even though the result of this subs tit-u t ion t ±* 
r i -2/5 , 

J“ y d ? 

2 

is given correctly* Eft the same problem the substitution y = x is incorrect 
when it results in 
hi 




The SARGE program meet* the shove requirements fuirly w«U. Se5ow we 
shall describe the manner in which one CCmmunie*t*s a solution to SARGE. 

Wo will also indicate how the system operates. Later wo shall criticise 
the present version of the system and present suggestions for its improvement. 

Comiaunlca ting with SARGE 

W& will present the syntax of the language that the atudent mu*t use to 
communicate a solution to SARGE through an interaction between an imaginary 
student and the program, (see figure 1), Let us suppose that the problem 
to be integrated is 

J(s 4 1) (sc + 2 } dx. 

This problem would be typed by SARGE as f o l lows j 
0 PROBLEM 1MT (K + 1) (l 4 2) DX 

The above line is representative of the statements in SARGE f a language. All 
statements in the language have the following forme 

Line~Fiumbrif command (command argument) (expression) 

(The latter two component* of a statement hove been parenthesised to indicate 
that they need not be present in each Statement.) 

The f Lr*t component of an input line is an unsigned integer which is 
rehen 4S * line number. Line numbers sre chosen by the student, except 
that the problem, is always presented as Line 0. Line numbers should bit 
distinct, end are used to identify steps in a solution. 

Comoands are used to indicate h&w the expression Arose in the solution 
process. The current set of CCumindi is PROBLEM;, 5UU5T, TKAN5F* Atf$ , 

PARTOF i and RETURprTO. The FK0BLEK command ie used Only in defining a new 
problem* and may not be used later on in the solution. The PROBLEM conmiar.1 
has no command argument. Thus Che next component of the line is taken tn 



be an expression * Line 0 indicates some interes ting features of the 
expression syntax. For example, concatenation implies multiplication except 
when num.bers are involved. This is the standard mathematical convention, 
and it forces the convention that all variables are single characters, 
fourth ermore f an attempt was made to keep the usual syntax of the indefinite 
integral. Thus 



3n line 1 the command used is TRANSf, 

1 transf int x$Z + 3x +2 dx 

This command is used when the expression is derived from the previous Line 
by an equivalence preserving transformat ion, Line I is derived from lino 0 
by an expansion. Note that no justification need be given for the 
equiva lence, 

In line 1 we note that the dollar sign signifies exponentiation. The 
convention regarding the dollar sign is that if the exponent or base are 
single syntactic units (unsigned numbers or alphabetic characters) then no 
parentheses are required. If more complex bases Or exponents ate used 
{e,g,, x+l), then parentheses must be used* This commonly-used convention 
applies also to the slash which signifies division. 

The first statement which has the line (lumber 2 is a t rang formation 
which involves certain identities regarding the Integration operator. 


2 Trans £ lot jt$2dx 4- int 2x dx + int 2dx 


SARGE is awni'K of these linearity identities and moreover recognizes 
an error in transcription. This is noted by SARlGE 1 s next response- 


AN ERROR OCCURRED AFTER LINE 1 WAS ANALYZED 


An examination of figure t will indicate that 5AEGE f s response up to 
now was a (lumbar which equaled the Hue number of the previous statement. 
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This is called the alow [node of In terac tlOcw In the faaC mode,, SARGE will 
make no responses unless an error octufSx The normal mode ts slew. A ch-inga 
In mode can be made by executing SlUJW NIL oe FAST NIL prior to working on a 
problem. FAST NIL will cause the mode to change to fast. 

After Indicating that an error occurred the system anticipates a new 
set of key words * The keyword 'help' which is used in this case causes 
SARGE to examine a list which contains probable explanations for the error. 

Cleatly the help mechanism should be used Only as a 3ast resort hy the 
student. Frequently SARGE has no reasonable idea to transmit to the user» 
in which case the response is 

SORRY, NO IDEA WHY ERROR OCCURRED. 

In the present situation if oan Only offer the Student a naeans for correcting 
his last step. It therefore types 

THE DIFFERENCE BETWEEN THE EXPECTED EXPRESSION AND YOURS IS 

X 

Other keywords which are recognised following; an error are 'quit' and 
’wal 1 . The quit mechanism allots the user to Stop working on the currant 
problem. The £U£l mechanism t used during the da hugging of 5ARGE, a Hows One ta 
execute any LISP function available in the ayatern. Any line which follows 
an error and which doea not begin with these keywords is Ignored. 

The aecond occurrence of line 2 indicates that the student 
2 tranaf int Zdx + int i; dx + int 2dx 

has accepted the advice offer red by BARGE, In line 3 the user is preparing to tackle 
a aubprob Ism of line 2. Ho identifies the part of the expression he wants 
to work on ^axt with the PARTQ-F command. The command argucrient here is s lino 
numbe r. 


3 


par tof 2 int x$2dx 



In line 4 the user is attempting to make a substitution on the 
subproblem selected! in the previous line, the command 
4 subs t y«$2 , int y dy 

argument is an equation relating the new variable to the old one^ Ttua 

operation is restricted to having only one occurrence of the new variable 

('y' in this case), For syntactic purposes a comma must follow this equation 

and precede the expression of the integral* As it turns out the result of 

the substitution is erroneous. SAKGK now prints its standard error comment. 

A call for help at this point yields a Very appropriate reply. 

IN YOUR SUBSTITUTION YOU FORGOT TO DIVIDE 
EY THE DERIVATIVE OF THE TRANSFORMATION 

The second line 4 shows a correction 

4 sub, y=s42 , int 0,5 ySQ-5 dy 

of this error, (tfe note a new syntactic device *- a comma following one or 
more of the left-most characters in a command can be used to abbreviate the 
command name. 

Line 5 was included here ss an example of the RETUftNTQ mechanism. This 
mechanism allows a student who is frustrated in his current attempt at a 
solution to continue from a previous line. 

Otir student is determined to solve this problem in line 3 (which 
admittedly is a trivial problem) by a substitution. This analysis explains 
the first line 6. 

t 4ubst y=xS(l/2), int y? 

The qu$Scion-mark is a CTB5 device for deleting the input line. Actually 
an empty line is transmitted to EAftGE and Is ignored. In some situations a 
student may wish to stop processing of a solution in A more drastic fashion. 
This would occur if die student typed a number of lines in fast mode and 


recognised an error in them before the S jfS tern had a ctrance to analyze the 
aointion* An interrupt from the console would cause an error h and the message 
from &ARGE will indicate the extent to which it analysed the solution. 

The second line 6 shows chat our poor student has finally seen the 
light end recognised the solution of the subproblem, 

6 ens 3 x$3/3 

Ihe. Ah'S command has as its command argument the line number of the 3 -yb-problem. 
whoso Integral appears as the expression- 

He are now approaching the end of the interaction- In lino 7 we go 
bach to line 2 to pick up another suhproblem. 

7 partof 2 3 in t x dx 

Line & contains the integral of that subproblem, 

& ens 7 3/2 k$2 

Line 9 eleirte to be the solution to the original problem (line 0), which it 
Is indeed, Hote that *lina{n) 1 cay&es the expression of lins n tfr be Sub¬ 
stituted in Ita posit Left in the expression. 

He fiive in figures 2 and 3 two further examples of interactions with 
SARGE- Wo new linguistic features are introduced except for the use of 
trigonometric functions in the obvioua manner* 


How ac-es &ARGE work 7 


From a certain point of view the S-AftcE program fa quite trivial. It took 
u-S only two weeks to got most Of the system running and in partially da bug. ted 
shape. However the 4 U« o£ the program ti quite large (about ?0-00 compiled 
lost rue t tons and an equivalent o£ 3000 interpreted ones in the usual 32 K 
LISP 1+3 System On CTSS)► As we indicated earlier we intended to rely on 
e«u large store of available 115P programs in the area uf algebraic 
manipulation* The most important of these programs, for our purposes, was 
William Martin's equivalence matching routines (see 1, Chapter A). This 
program determines whether two expression* ere equivalent by a very clever use 
of haah coding. Martin's program is deadly in recogniting equivalence of 
rational functions or rational function of trigonometric functions. For 
examplep this program allows SAUCE to recognise the equivalence of 1/2 sln^x) 
and s£n(x) cos(x) in figure 2 . This routine is currently weak in recognising 
equivalence of expressions involving square-roots + One could, however, design 
special routines which would recognise such situations and overcome them in 
many instances. Should the matching routine fail to recognise an equivalence;, 
then SAJMJE would claim that an error was made when, in fact, none was. 

Currently there exists no machinery which, allows the uSet to argue against 
the decisions of eA3GEh Though errors of this nature occur very rarely, ft 
would be useful Co give the user the capability of overcoming the deciaions 
of the system- 

Another one of Kartin 1 a programs fs used for parsing the input expres*ion 
into prefix notation. The input line is read, character by character! by one 
of our programs which trsnS forms the expression somewhat before giving it to 
Martin 1 s parsing routine (e.g., special attention must be paid to the integral 
sign syntax). The result of the parse of 


int £x+l d)t 


(integral (plus (times 2 x) 1) x] . 

If the expression is a sum of integrals then it trans formed into an integral 
of a sum, Ker.ce line 2 of figure 1 is stored internally exactly as line 1 is. 
This transformation simplifies the coding. It ahoyId be noted that the svstem 
currently requites that an expression reduce either to a aImp1e integral or to 
an express ion involving no integrals , Hence inteseattop-by-part# is an illegal 
transformation, The changes necessary to overcome this restriction can be 
easily mud® 4 

It should be clear that, in addition to the routines mentioned above, we 
require a symbolic differentiation program (used, for example, in dotoTraining 
tilt Correctness of a proposed, answer), a simplification program (an important 
component of the substitution checking mechanism), and a '’solve 11 program 

(also needed in substitutions). These routines were stolen from oyr work On 

► 

SIN(2), the infamous symbolic integrator. For the operation of theae routines, 
we required the ECHATCHEN pattern matching program. The SCHATCHEU language 
allows one to express concepts Like "a quadratic in k 11 quite concisely. We 
have not made much use of S CHATCtlEt-J in the current BARGE t v& t«m, although we 
expect that an improved version will find its presence quite useful. 

The main routine in SASCE, also called BARGE, is a fairly straight¬ 
forward interpreter of the input lines. Whenever an error occurs a check is 
made to determine whether any rules exist about how to treat the input line. 

If rules exist and the input line fits the expected error condition, a note 
la made In a location called s heLp\ otherwise ’help 1 is unchanged, 

The main program is executed interpretively during the operation of flic 
system because there is no space available in which to compile it and because 
it changes so drastically. The three Interactions described in figures 1 3 


tie It require about twenty 5 ecocide of execution time acid a similar amount of 
swapping tine, Hence the answer to the question posed in the beginning of 
this section is -- SARGE works slowly. It is Likely that a three-fold 
improvement in speed ton Id be had if the entire system were compiled. 

There are* at present, two modes ip which a problem is proposed to the 
user. In the slave mode the program proposes one of a set of built-in 
problems. This mode is activated by typing 'SIAVE MIL'. In the master mode, 
the user decides the problem on which he wants Co work. Ihis mode is Activated 
by typing "MASTER MIL". In either case, control reverts to LISP following an 


interac tion 


Critique and Future Directions 


below we shall consider sraae criticisms of the present SARGE system, 

1) The Syntax Of SARGE is One dimensional. It should he two djmen3 iurat - 
The use of a typewriter as the input medium hinders a natural interaction with 
the computer, A RASP Tablet or &<jme similar device is clearly called for., 
programs which perse mathe™tica 1 expressions written on a JiAHP Tablet have 
been designed by Andersen (3) andt Martin (4). Such programs should be 
operational soon. However, there are some difficulties with the use of a 
RAHD Tablet for this application. The typewriter input forces on the user a 
line-by-line format, Given the freedom possible with a RAHD Tablet t the user 
ia Likely to generate much input which does not conform to SARGE 1 s syntax 
(e,g, ,, side calculations for the determination of derivative?, doodles, etc,). 
This will make the parsing problem a good deal more temp lies ted- Furthermore 
a user station which include? a scope and a RAND Tablet is a good 
deal more expensive and consumes more computer time than a station which 
simply has a typewriter cenacle. 

1) The system ia too inefficient for practical use by a large body of 
students. This is principally due to the scavenger approach we adopted ip 
designing the System► A specially designed version should run substantially 
fas ter, 

3) The error diagnosing capability of the ft VS t Vim is n&t very powerful. 
What appears to be necessary here 1* a jophfst leated analysis oi the mis Cakes 
that students frequently make in integration problems. Such research cbuld 
proceed in several stages. At one stage one might determine a set of commonly 
made errors (such as the error in Substitution of figures 1 and 3), These 
errors would be known CO the program and it would determine when (Ugh errors 
account for the input expression. In a second step, one would analyze ihe 
interactions students have with such a program. This analysis could be used 


to improve the diagnostic capability. Finally one would want the program to 
build models of the behavior of each student so that the problems which arc 
posed and the diagnosis given by the system becomes individualized. These 
are non-trivial objectives which face many difficulties. One obvious source 
of difficulty is the intelligent student who will try to "progran’ the system 
in order to see what it will do under certain situations. The system should 
bo clover enough to recognise when it is being 'hacked' r 

Hie Current error comments are net at all subtle. It would be 
preferable If one could proceed In stages by using hints at first. If a stripe 
were available to the program, then such hints could be built into the 
script for each problem. We have tried to avoid the script-based approach 
as much aa possible. We have depended on the knowledge that the program has 
of the material to help it in diagnoses. Such knowledge is usually lacking in 
script-based teaching systems* However, one way Of extending the present 
system would involve the use of scripts* For example, one could include in 
a script several methods of golotion and several common ml*takes made cm 
each particular problem. In order to keep the dye tom as flexible at possible 
one would have to determine how far the student has progressed in following a 
given solution method, and how to return to such built-in solutions from the 
solution developed by the Student* This problem, too, is hardly trivial. 

In sunmary, it would seem to im chat # marriage Of the script-based 
approach and the r knowledge-based 1 approach te preferable to either approach. 
However there are difficult problems in making. Such a marriage a viable 


propoaition. 
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computer 

computer 

user 

computer 

user 

slave nil 

0 PROBLEM LHT (K4i)(X42) DX 

0 

1 trs-tisf lot x$2 43x 4 2 dx 

l 

2 transf ieat x£2 dx + int 2x. dx + int2-dx 

computer 

AN ERROR OCCURRED AFTER LIKE 1 WAS ANALYZED 

user 

help 

computer 

computer 

computer 

user 

computet 

user 

c omput er 

user 

THE DIFFERENCE BETWEEN THE EFFECTED EXPRESSION AND YOURS IS 
X 

1 

2 teansf int x$2 dx 4 int 3* dx 4 int 2 dx 

2 

3 partof 2 int x$2 dx 

3 

4 aubst y=x$2, int y dy 

c ompu tor 

AM ERROR OCCURRED AFTER LIKE 3 WAS ANALYZED 

user 

help 

computer 

computer 

computet 

user 

compu ter 

user 

cor-pu te ¥ 
user 

user 
c ornpu t et 
user 

computet 

user 

computet 

user 

IH YDUS SUBSTITUTION YOU FORGOT TO DIVIDE &Y THE DERIVATIVE 
OF THE TRANSFORMATION 

3 

4 sub sty =x$2 f . int 0 * 5 y$ 0. 5 d y 

4 

5 ret *3 

5 

$ §ubst y =k$ (1/2) T int y? 

6 ana 3 x§3/3 

6 

7 partof 2 int 3 X dx 

7 

$ ana? l'5x$2 

B 

9 ans D line{&) + line(&) 42 x 

computer 

CORRECT 


Figure 1 
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0 PROBLEM ENT S IN (X)COS (X) DX 

0 

1 treesf int 0.5sin(2x)dx 

1 

2 true«£ 0+5 int tin (2x) dx 

2 

1 subst y^x, 1/2 int 1/2 sin(y) dy 

3 

4 ans3 (1/2)(1/2)(-cos(y}> 

4 

3 snsO -l/4cos(2x) ( sin(x)$2 + cos(x)$2 ) 
CORRECT 


Figure 2 
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fast nil 
NIL 

naatftt till 
0 FRO B iUEM 

int e${£x)/< l + e$(4ac)) <J* 

1 svb&t y^e$x, int y$2/(l+yf4> dy 


computer AN ERROR OCCURRED AFTER LINE 0 WAS ANALYZED 


user 

user 

user 

user 

user 

cCmvpLi ter 


ohyes 

1 aubat y=e$K, In t y/(L+y$4) dy 

2 a Lib, z=y$2 , int 0.5/(l+z:$2 ) da 

3 ana 2 1/2 aretan(a) 

4 ana 0 1/2 axetan ((c$x)£2.} 

CORRECT 


Figure 3 
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